AITopics | sharpness-aware minimization

Collaborating Authors

sharpness-aware minimization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Riemannian SAM: Sharpness-Aware Minimization on Riemannian Manifolds

Neural Information Processing SystemsApr-29-2026, 20:12:39 GMT

Contemporary advances in the field of deep learning have embarked upon an exploration of the underlying geometric properties of data, thus encouraging the investigation of techniques that consider general manifolds, for example, hyperbolic or orthogonal neural networks. However, the optimization algorithms for training such geometric deep models still remain highly under-explored. In this paper, we introduce Riemannian SAM by generalizing conventional Euclidean SAM to Riemannian manifolds. We successfully formulate the sharpness-aware minimization on Riemannian manifolds, leading to one of a novel instantiation, Lorentz SAM. In addition, SAM variants proposed in previous studies such as Fisher SAM can be derived as special examples under our Riemannian SAM framework. We provide the convergence analysis of Riemannian SAM under a less aggressively decaying ascent learning rate than Euclidean SAM. Our analysis serves as a theoretically sound contribution encompassing a diverse range of manifolds, also providing the guarantees for SAM variants such as Fisher SAM, whose convergence analyses are absent. Lastly, we illustrate the superiority of Riemannian SAM in terms of generalization over previous Riemannian optimization algorithms through experiments on knowledge graph completion and machine translation tasks.

artificial intelligence, gradl, machine learning, (17 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification

Neural Information Processing SystemsMar-22-2026, 20:11:18 GMT

Graph Neural Networks (GNNs) have shown superior performance in node classification. However, GNNs perform poorly in the Few-Shot Node Classification (FSNC) task that requires robust generalization to make accurate predictions for unseen classes with limited labels. To tackle the challenge, we propose the integration of Sharpness-Aware Minimization (SAM)--a technique designed to enhance model generalization by finding a flat minimum of the loss landscape--into GNN training. The standard SAM approach, however, consists of two forward-backward steps in each training iteration, doubling the computational cost compared to the base optimizer (e.g., Adam). To mitigate this drawback, we introduce a novel algorithm, Fast Graph Sharpness-Aware Minimization (FGSAM), that integrates the rapid training of Multi-Layer Perceptrons (MLPs) with the superior performance of GNNs. Specifically, we utilize GNNs for parameter perturbation while employing MLPs to minimize the perturbed loss so that we can find a flat minimum with good generalization more efficiently.

artificial intelligence, machine learning, proceedings, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.59)

Add feedback

Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima

Neural Information Processing SystemsFeb-19-2026, 11:25:20 GMT

To address this gap, we study deterministic/stochastic versions of SAM with practical configurations (i.e., constant

artificial intelligence, machine learning, theorem 4, (15 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

e095c0a3717629aa5497601985bfcf0e-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 13:57:20 GMT

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Nevada (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Riemannian SAM: Sharpness-Aware Minimization on Riemannian Manifolds

Neural Information Processing SystemsFeb-17-2026, 05:15:34 GMT

As a significant direction in this line, hyperbolic representation learning has been shown to offer several advantages over conventional Euclidean geometry.

machine learning, manifold, natural language, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

Optimal Transport Model Distributional Robustness Van-Anh Nguyen

Neural Information Processing SystemsFeb-11-2026, 12:17:03 GMT

SAM aims to find a perturbed model within the vicinity of a current model that maximizes the loss over a training set.

artificial intelligence, distributional robustness, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Vietnam (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

EscapingSaddlePointsforEffectiveGeneralizationon Class-ImbalancedData

Neural Information Processing SystemsFeb-10-2026, 18:28:13 GMT

Several techniques based on re-weighting and margin adjustment of loss are often used toenhance theperformance ofneural networks, particularly onminority classes.

artificial intelligence, arxivpreprintarxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

FundamentalConvergenceAnalysisof Sharpness-AwareMinimization

Neural Information Processing SystemsFeb-8-2026, 14:25:26 GMT

Additionally, it is evident that the results in (ii) do not implytheconvergenceof f(xk) to0.

artificial intelligence, justification, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
Europe > Switzerland (0.04)
Asia > China (0.04)

Genre: Research Report (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The Crucial Role of Normalization in Sharpness-Aware Minimization

Neural Information Processing SystemsDec-26-2025, 21:30:06 GMT

Sharpness-Aware Minimization (SAM) is a recently proposed gradient-based optimizer (Foret et al., ICLR 2021) that greatly improves the prediction performance of deep neural networks. Consequently, there has been a surge of interest in explaining its empirical success.

crucial role, normalization, sharpness-aware minimization, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

Filters

Collaborating Authors

sharpness-aware minimization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Riemannian SAM: Sharpness-Aware Minimization on Riemannian Manifolds

Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification

Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima

e095c0a3717629aa5497601985bfcf0e-Supplemental-Conference.pdf

Riemannian SAM: Sharpness-Aware Minimization on Riemannian Manifolds

5bf2b802e24106064dc547ae9283bb0c-Paper-Conference.pdf

Optimal Transport Model Distributional Robustness Van-Anh Nguyen

EscapingSaddlePointsforEffectiveGeneralizationon Class-ImbalancedData

FundamentalConvergenceAnalysisof Sharpness-AwareMinimization

The Crucial Role of Normalization in Sharpness-Aware Minimization